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ART-UNIT: 273 

PRIMARY- EXAMINER: Eng; David Y. 
ATTY- AGENT -FIRM: Augspurger; Lynn L. 

ABSTRACT: 

A system and method for improving microprocessor computer system out of order 
support via register management with synchronization of multiple pipelines and 
providing for processing a sequential stream of instructions in a computer system 
having a first and a second processing element, each of the processing elements 
having its own state determined by a setting of its own general purpose and control 
registers. When at any point in the processing of the sequential stream of 
instructions by the first processing element it becomes beneficial to have the 
second processing element begin continued processing of the same sequential 
instruction stream then the first and second processing elements process the 
sequential stream of instructions and may be executing the very same instruction 
but only one of said processing elements is permitted to change the overall 
architectural state of the computer system which is determined by a combination of 
the states of the first and second processing elements. The second processor will 
have more pipeline stages than the first in order processor to feed the first 
processor and reduce the finite cache penalty and increase performance. The 
processing and storage of results of the second processor does not change the 
architectural state of the computer system. Results are stored in its gprs or its 
personal storage buffer. Resynchronization of states with a coprocessor occurs upon 
an invalid op, a stall or a computed specific benefit to processing with the 
coprocessor as a speculative coprocessor. 

27 Claims, 8 Drawing figures 
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File: USPT 



Apr 18, 2000 



DOCUMENT -IDENTIFIER: US 6052771 A 

TITLE: Microprocessor with pipeline synchronization 



Detailed Description Text (28) : 

Just after the SFE enters the Running State, it will begin to execute out of order 
instructions while the uPCore continues execution at its own pace fetching its 
instructions from the cache 2 03 including those the speculative engine processing 
element SFEs storage references that are supplied to cache storage 2 03 of 
instruction and operand data in advance of their use by the uPCore conventional 
processing element. In the preferred embodiment the uPCore can be excluisively 
designed as an in order processor, or one optomized for in order processing or be 
one able to handle processing instructions of instructions where substantially less 
than 95% of all instructions will not benefit from prediction. Therefore it may 
experience a pipeline stall in the case of a LI cache miss. The SFE may continue 
past the instruction which caused the stall since it is capable of out of order 
execution. During the time that the SFE is running it generates fetch references 
that are sent to both the Instruction and Data Caches over interface 207 and to the 
Store Buffers over interface 208. A cache miss is experienced if both the Caches 
and the Store Buffers do not have the desired data. Instructions and Operands are 
returned to the SFE over interface 2 07 in the case where there is no relevant entry 
in the Store Buffer, and are returned over interface 208 if there is a relevant 
entry in the Store Buffer. SFE store references are not sent to the Instruction and 
Data Caches and are instead sent to the Store Buffer. In this way, the results of 
SFE store instructions can be made available to subsequent instructions executed on 
the SFE without altering the architectural state of the uPCore and Caches. All SFE 
stores are kept in the Store Buffers. 

Current US Original Classification (1) : 
712/34 
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FOREIGN -PAT -NO 



PUBN-DATE 



COUNTRY 



US-CL 



WO 97/14101 



April 1997 



WO 



OTHER PUBLICATIONS 



"Accelerating Your Graphics--A Diamond Multimedia White Paper" , Diamond Multimedia 
Systems, Inc., 2 pages, 1997. 

"Accelerated Graphics Port Interface Specification", Intel Corporation, Rev. 2.0, 
pp. 79-87, May, 1998. 

Datasheet, Intel 440BX AGPset : 82443BX Host Bridge/Controller, Order No. 290633- 
001, Intel Corporation, Apr., 1998. 

Diamond Multimedia Exotic Memory Whitepaper, Diamond Multimedia Systems, Inc., 8 
pages, 1997. 

"Intel Platforms for Visual Computing: A White Paper on Intel's Visual Computer 
Initiative", Intel Corporation, 8 pages, Mar., 1997. 

S3 ' s Savage3D Accelerator: A Comprehensive Architecture for Superior PC Video, 
Savage3D. white papers, 9 pages, S3 Incorporated, 1998. 

The Accelerated Graphics Port (AGP)- -A Diamond Multimedia White Paper, Diamond 
Multimedia Systems, Inc., 2 pages, 1997. 

"Write Combining Memory Implementation Guidelines", Intel Corporation, 17 pgs . , 
Nov. 1998. 

ART-UNIT: 2182 

PRIMARY- EXAMINER: Gaff in; Jeffrey 
ASSISTANT-EXAMINER: Nguyen; Tanh Q 
ATT Y- AGENT -FIRM: Carr & Ferrell LLP 



A system and method for increasing effective bus bandwidth in communicating with a 
graphics device. Graphics commands and associated parameters are written into a 
contiguous region of system memory and transmitted in a weakly ordered fashion over 
a bus to a graphics device. The graphics device reorders the incoming data into the 
same order as which the data was written into the contiguous region of system 
memory, thereby allowing the use of order dependent encoded commands with the 
weakly ordered bus interface. 

16 Claims, 15 Drawing figures 
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L24: Entry 1 of 2 



File: USPT 



Jun 11, 



2002 



DOCUMENT- IDENTIFIER: US 6405267 Bl 

TITLE: Command reordering for out of order bus transfer 
Detailed Description Text (2) : 

The present invention provides a system and a method of increasing effective bus 
bandwidth utilization by reordering out of order data received from a weakly 
ordered bus interface. FIG. 1 illustrates a computer system using the present 
invention. The computer system includes a CPU 1 which communicates with a graphics 
device over a bus 7. A graphics driver program runs on the CPU. The graphics driver 
creates drawing commands and parameters utilized in conjunction with the drawing 
commands. The driver arranges the drawing commands and parameters into 32 bit 
registers. These 32 bit registers are reserved regions of physical memory address 
space, and in the embodiment of FIG. 1 are a contiguous region A of system memory 3 
or are sequentially ordered areas of memory identifiable by address. In the 
embodiment of FIG. 1 the contiguous region A is defined as a write combining memory 
type. Further, the graphics driver sequentially writes commands and parameters into 
sequentially increasing address locations of the memory, until an upper bound is 
reached. Once the upper bound is reached, the graphics driver begins writing data, 
i.e., commands and parameters, starting at the lower bound when a command is ready 
to be written. Using the bus, the commands and parameters are communicated to the 
graphics device 5, which often includes a co-processor . 



http://westbrs:9000/bir^ 3/11/04 



Record Display Form 



Page 1 of 2 



First Hit Fwd Refs 
End of Result Set 



□ p"^^ [Print 



L24 : Entry 2 of 2 



File: USPT 



Apr 4, 2000 



US-PAT-NO: 6047367 
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TITLE: Microprocessor with improved out of order support 
DATE-ISSUED: April 4, 2000 
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NAME 
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CITY 
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NAME 

International Business Machines 
Corporation 

APPL-NO: 09/ 009828 [PALM] 
DATE FILED: January 20, 1998 
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PRIMARY -EXAMINER: Coleman; Eric 

ATT Y- AGENT -FIRM: Augspurger; Lynn L. 

ABSTRACT : 

A system and method for improving microprocessor computer system out of order 
support via register management with synchronization of multiple pipelines and 
providing for processing a sequential stream of instructions in a computer system 
having a first and a second processing element, each of the processing elements 
having its own state determined by a setting of its own general purpose and control 
registers. When at any point in the processing of said sequential stream of 
instructions by said first processing element it becomes beneficial to have the 
second processing element begin continued processing of the same sequential 
instruction stream then the first and second processing elements process the 
sequential stream of instructions and may be executing the very same instruction 
but only one of said processing elements is permitted to change the overall 
architectural state of said computer system which is determined by a combination of 
the states of said first and second processing elements. The second processor will 
have more pipeline stages than the first in order processor to feed the first 
processor and reduce the finite cache penalty and increase performance. The 
processing and storage of results of the second processor does not change the 
architectural state of the computer system. Results are stored in its gprs or its 
personal storage buffer. Resynchronization of states with a coprocessor occurs upon 
an invalid op, a stall or a computed specific benefit to processing with the 
coprocessor as a speculative coprocessor. 

11 Claims, 8 Drawing figures 
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File: USPT 



Apr 4, 2000 



DOCUMENT- IDENTIFIER: US 6047367 A 

TITLE: Microprocessor with improved out of order support 



CLAIMS : 



1. A computer system having a hierarchical memory with cache storage for 
instructions and data, comprising, 

at least one conventional processing element for processing instructions with at 
least one instruction pipeline of a defined length and defined delay per pipeline 
stage ; 

and an additional speculative engine processing element for processing 
instructions, including out-of-order instructions, for instruction sequences which 
derive finite cache improvement from out-of-order processing, wherein 

said additional speculative engine processing element and said conventional 
processing element are coupled for action in concert to process a stream of 
instructions with said conventional processing element maintaining the 
architectural state of concerted action with its pipeline handles while said 
speculative engine processing element handles speculative processes whose result 
may not change the architectural state of the computer system but which improves 
the finite cache penalty seen by said conventional processing element, and wherein 
said speculative processing element includes 

m registers in a central processing area, m being greater than a predetermined n 
number of instruction addressable GPRs identified by one or more binary fields of 
an instruction, and provision for out of order instruction execution and for 
processing a conditional branch instruction based on a branch direction guess, 

said conventional processing element being coupled to a coprocessor providing said 
additional speculative engine processing element for independent prefetching and 
execution of instructions to improve the sequence of storage references as seen by 
said conventional processing element, 

said system using a register management process enabling generation of speculative 
memory references to said storage hierarchy including said instruction and data 
cache coupled to and shared by both said additional speculative engine processing 
element and said conventional processing element, 

and further including, 

an instruction and data cache coupled to both said conventional out of order 
processing element and to said coprocessor providing said additional speculative 
engine processing element for fetching instructions and data, and an independent 
store buffer for bidirectional transfer of instructions and data in response to 
store and fetch commands of said additional speculative engine processing element. 
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11. A microprocessor having a hierarchical memory with cache storage for 
instructions and data, comprising, 

at least one conventional processing element for processing instructions with 
multiple pipelines of a defined length and defined delay per pipeline stage; 

and an additional speculative engine processing element for processing 
instructions, including out-of-order instructions, for instruction sequences which 
derive finite cache support from out-of-order processing, wherein 

said additional speculative engine processing element and said conventional 
processing element are coupled for action in concert to process a stream of 
instructions with said conventional processing element maintaining the 
architectural state of concerted action with its pipeline handles while said 
speculative- engine processing element handles speculative processes whose result 
may not change the architectural state of the computer system but which improves 
the finite cache penalty seen by said conventional processing element with register 
management and synchronization of said multiple pipelines, and further including, 

an instruction and data cache coupled to both said conventional out of order 
processing element and to said coprocessor providing said additional speculative 
engine processing element for fetching instructions and data, and an independent 
store buffer for bidirectional transfer of instructions and data in response to 
store and fetch commands of said additional speculative engine processing element. 
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Tensilica, Inc. Santa Clara CA 02 
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OTHER PUBLICATIONS 



Texas Instruments, M TMS32010 User's Guide", 1983, p. 3-7.* 

Lefugy, C. et al . , "improving Code Density using Compression Techniques", 12/97. p. 
194-204 . * 

Conte, T.M. et al . , "Instruction Fetch Mechanisms for VLIW Architectures Compressed 
Encodings", 12/96. p. 201-211. 

ART-UNIT: 282 

PRIMARY -EXAMINER: Niebling; John F. 
ASSISTANT -EXAMINER: Whitmore; Stacy 
ATT Y- AGENT -FIRM: Pillsbury Winthrop LLP 



ABSTRACT : 



A RISC processor implements an instruction set which, in addition to optimizing a 
relationship between the number of instructions required for execution of a 
program, clock period and average number of clocks per instruction, also is 
designed to optimize the equation S=IS * BI, where S is the size of program 
instructions in bits, IS is the static number of instructions required to represent 
the program (not the number required by an execution) and BI is the average number 
of bits per instruction. Compared to conventional RISC architectures, this 
processor lowers both BI and IS with minimal increases in clock period and average 
number of clocks per instruction. The processor provides good code density in a 
fixed-length high-performance encoding based on RISC principles, including a 
general register with load/store architecture. Further, the processor implements a 
simple variable- length encoding that maintains high performance. 



21 Claims, 5 Drawing figures 
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File: USPT 



Aug 28, 2001 



DOCUMENT- IDENTIFIER: US 62 82 63 3 Bl 
TITLE: High data density RISC processor 



Detailed Description Text (56) : 

This is a new variant of the condition code -based compare and branch found in many 
earlier instruction sets, as discussed above. Earlier instruction sets have 
multiple shared mult i -bit condition codes between the processor and its 
coprocessors (e.g., the PowerPC) and used multiple per -coprocessor single-bit 
condition codes (e.g., MIPS). The preferred embodiment of the present invention 
uses multiple shared single-bit condition codes. 
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ART-UNIT: 277 

PRIMARY-EXAMINER: Ramirez; Ellis B. 

ATTY- AGENT -FIRM: Skjerven, Morrill, MacPherson, Franklin & Friel LLP Koestner; Ken 
J. 



ABSTRACT : 

A multiprocessor computer system includes a plurality of processors, called 
asymmetric processors, having mutually dissimilar control and data-handling 
characteristics. The asymmetric processors are controlled by a single operating 
system although the individual processors have instruction sets that are mutually 
independent of the other processors. The multiprocessor computer system uses a 
multiprocessor architectural definition of interrupt and exception handling in 
which a processor, called a data or vector processor, having a large machine state 
and a large data width detects exceptions but defers interrupt and exception 
handling operations to another processor, called a control processor, having a 
small machine state and data width. The small machine state and small data width of 
the control processor are well suited for executing operating system programs such 
as interrupt and exception handling since control programs typically involve 
monitoring and control of individual flags and pointers. The data processor enters 
an idle state upon reset and when an exception is detected to facilitate system 
design and programming, and to simplify synchronization of the processors at system 
reset. A multiprocessor computer system includes a control processor which reads 
and writes control and status registers within a data processor. The control 
processor thus controls the operation of the data processor during execution of an 
operating system or application programs. The control processor has access to the 
control and status registers of the data processor independent of the data 
processor execution so that the same control and status registers may be accessed 
by the control processor and the data processor in parallel. 

25 Claims, 9 Drawing figures 
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TITLE: System and method for handling interrupt and exception events in an 
asymmetric multiprocessor architecture 

Detailed Description Text (18) : 

The control processor 204 starts the vector processor 206 using a STARTVP 
instruction. The STARTVP instruction causes the vector processor 2 06 to enter a 
VP. sub.-- RUN state indicating that the vector processor 2 06 is executing. STARTVP 
does nothing if the vector processor 206 is already in the VP. sub.-- RUN state. No 
result is communicated to the control processor 2 04 in response to the STARTVP 
instruction and the control processor 2 04 continues execution following STARTVP. 
The STARTVP instruction is invoked following a coprocessor data operations format 
(CDP) instruction which sets a condition code . The STARTVP instruction is invoked 
using the following assembler syntax: 

Detailed Description Text (20) : 

The control processor 204 stops the vector processor 206 using an interrupt vector 
processor (INTVP) instruction. The INTVP instruction causes the vector processor 
206 to enter a VP. sub.-- IDLE state indicating that the vector processor 2 06 is not 
executing. INTVP does nothing if the vector processor 2 06 is not executing. No 
result is communicated to the control processor 2 04 in response to the INTVP 
instruction and the control processor 204 continues execution following INTVP. The 
INTVP instruction is invoked following a coprocessor data operations format (CDP) 
instruction which sets a condition code . The INTVP instruction is executed only if 
the condition is true. Thus, the INTVP instruction is invoked using the following 
assembler syntax: 

Detailed Description Text (22) : 

The control processor 2 04 tests the operating state of the vector processor 206, 
including testing for synchronization, using a test and set (TESTSET) instruction. 
TESTSET reads a user-extended register and sets bit 30 of the register to 1, 
supplying producer/consumer type synchronization between the vector processor 2 06 
and the control processor 204. TESTSET causes the control processor 204 to stall 
until the register is transferred. The control processor 204 requests the TESTSET 
instruction following a coprocessor register transfer operations (MRC, MCR) 
instruction which sets a condition code . The TESTSET instruction is executed only 
if the condition is true. The TESTSET instruction is called using the following 
assembler syntax: 

Detailed Description Text (25) : 

For example, a MFVP instruction moves data from a vector processor scalar/special - 
purpose register to a general register in the control processor 204. The MFVP 
instruction is called subsequent to a coprocessor register transfer operations 
(MRC, MCR) instruction which sets a condition code . The MFVP instruction is executed 
only if the condition is true. The MFVP instruction is called using the following 
assembler syntax: 

Detailed Description Text (27) : 

A move to vector processor (MTVP) instruction moves data to a vector processor 
scalar/special-purpose register from a general register in the control processor 
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204. The MTVP instruction is called subsequent to a coprocessor register transfer 
operations (MRC, MCR) instruction which sets a condition code . The MTVP instruction 
is executed only if the condition is true. The MTVP instruction is called using the 
following assembler syntax: 

Detailed Description Text (31) : 

A move to extended register (MTER) instruction moves data from the control 
processor 204 to a specified extended register. The MTER instruction is called 
subsequent to a coprocessor register transfer operations (MRC, MCR) instruction 
which sets a condition code . The MTER instruction is executed only if the condition 
is true. The MTER instruction is called using the following assembler syntax: 

Detailed Description Text (33) : 

A move from extended register (MFER) instruction moves data from an extended 
register in a designated coprocessor to a register in the control processor 204. 
The MFER instruction is called subsequent to a coprocessor register transfer 
operations (MRC, MCR) instruction which sets a condition code . The MFER instruction 
is executed only if the condition is true. The MFER instruction is called using the 
following assembler syntax: 
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OTHER PUBLICATIONS 

Vora et al, ."A VLSI Implementation of the 32 -Bit Eclipse Architecture " , Proceedings 
of IEEE Int'l Conf on Computer Design (ICCD '83) pp. 311-314. 

"80286 and 80287 Programmer's Reference Manual", Intel 1987, pp. 2-14-2-16 & 3-20. 
ART-UNIT: 232 

PRIMARY -EXAMINER: Rudolph; Rebecca L. 
ATT Y - AGENT - F I RM : Milik; Kenneth L. 

ABSTRACT : 

A central processing unit (CPU) 10 comprises an external control memory for storing 
microinstructions which correspond to macroinstructions read from a system memory. 
The microinstructions are 56 bits in length and are read in 28-bit segments. CPU 10 
also comprises an internal memory management unit (MMU) 18 which comprises a 
plurality of address translation entry (ATE) registers four of which are permanent 
and sixteen of which are temporary in that the storage of a new translation entry 
occurs in a least recently used temporary translation entry register. CPU 10 also 
comprises a plurality of status register bits, some of which are settable only by 
predefined microinstructions. All of the status register bits are branchable . CPU 
10 further comprises a condition code register the state of which may be determined 
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by input signal pins. CPU 10 also comprises address generation logic which may 
generate a 24, 31 or 32 bit address upon a 32 -bit address bus. The address 
generation logic is further operable for generating a memory storage address; the 
data being supplied by external logic, such as a coprocessor. 

17 Claims, 21 Drawing figures 
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TITLE: Microprocessor having external control store 
Detailed Description Text (18) : 

In accordance with aspects of the presently preferred embodiment of the invention, 
coprocessor 3 may be coupled to the output of control memory 14 for directly 
receiving and decoding the microinstruction stream as it is fetched for execution 
by CPU 10. The coprocessor 3 may also be coupled, via CPU 10 input signal pins XCCO 
and XCC1, to CPU 10 whereby coprocessor 3 is enabled to directly set the contents 
of a condition code register within CPU 10. These aspects of the invention provide 
for a tightly coupled, synchronous CPU/coprocessor interface. CPU 10 is also 
provided with bus request and bus grant signal 1 ines which provide for a loosely 
coupled, asynchronous CPU/coprocessor interface. Furthermore, the coprocessor 3 may 
be coupled to the system data bus such that, in accordance with one aspect of the 
invention, coprocessor 3 may read data from or store data within the system memory 
2 at an address provided by CPU 10. This aspect of the invention will be discussed 
below. 

Detailed Description Text (88) : 

In accordance with one aspect of the invention the condition code type CCX enables 
the setting of the condition codes in PMR [0:1] from the input signal pins XCCO and 
XCC1. This mode of operation is especially advantageous when the CPU 10 is 
operating in conjunction with the coprocessor 3 in that it permits a tightly 
coupled interface to be formed between the CPU 10 and the coprocessor. For example, 
the coprocessor 3, by controlling the state of XCCO and XCC1, may directly affect 
the execution of a CPU 10 microinstruction, such as the Branch with Condition Code 
Check (BCC) which will be described in Table 9. 

Detailed Description Text (144) : 

For systems using MMU 18, each hardware acceleration unit may interface to CPU 10 
via the memory data bus and any or all of the five control lines: an input to CPU 
10' s wait logic, an input to CPU 10' s coprocessor trap logic, an input to CPU 10' s 
BNM-time coprocessor trap logic, and two lines to CPU 10 's external condition code 
pins . 

Detailed Description Text (145) : 

For systems having external address translation hardware, each hardware 
acceleration unit may interface to CPU 10 via the memory data bus and any or all of 
the four control lines: an input to CPU 10' s wait logic, an input to CPU 10* s BNM- 
time coprocessor trap logic, and two lines to CPU 10' s external condition code 
pins . 



http://westbrs:9000/bin/gate.exe? 3/11/04 



Record Display Form 



Page 1 of 3 



First Hit Fwd Refs 



□ Generate Collection Print 



L2 9: Entry 18 of 2 7 



File: USPT 



Feb 5, 1991 



US-PAT-NO: 4991080 

DOCUMENT- IDENTIFIER: US 4 9910 8 0 A 



TITLE: Pipeline processing apparatus for executing instructions in three streams, 
including branch stream pre -execution processor for pre -executing conditional 
branch instructions 



DATE- ISSUED: February 5, 1991 



I NVENTOR - I NFORMAT I ON : 
NAME 

Emma; Philip G. 
Pomerene; James H. 
Rechtschaf f en; Rudolph N. 
Sparacio; Frank J. 



CITY STATE 

Danbury CT 

Chappaqua NY 

Scarsdale NY 

North Bergen NJ 



ZIP CODE 



COUNTRY 



ASS IGNEE - INFORMATION : 
NAME 

International Business Machines 
Corporation 

APPL-NO: 06/ 839312 [PALM] 
DATE FILED: March 13 , 1986 

INT-CL: [05] G06F 9/38, G06F 9/28 

US-CL-ISSUED: 364/200; 364/263, 364/261.3, 364/261.5, 364/263.1, 364/262.4, 
364/261.7 

US -CL- CURRENT: 712/206; 712/235, 719/310 



FIELD-OF- SEARCH : 364/2MSFile, 3 64/9MSFile 



CITY STATE ZIP CODE COUNTRY TYPE CODE 
Armonk NY 02 



PRIOR-ART-DISCLOSED : 



U.S. PATENT DOCUMENTS 



Search Selected 




Search ALL 




Clear 



PAT-NO ISSUE-DATE PATENTEE-NAME US-CL 

□ 3559183 January 1971 Sussenguth 364/200 

□ 3707725 December 1972 Delheim 364/200 

□ 4062058 December 1977 Haynes 364/200 

□ 4155120 May 1979 Keefer et al . 364/200 



http://westbrs:9000/bin/gate.exe?f^doc&state=iinci 1 .40. 1 8&ESNAME=FRO&p_Message=... 3/1 1/04 



Record Display Form 




Page 2 of 3 



n 


4270167 


May 1981 


Koehler et al . 


364/200 


□ 


4325120 


April 1982 


Colley et al . 


364/200 


□ 


4338675 


July 1982 


Palmer et al . 


364/748 


n 


4442484 


April 1984 


Childs, Jr. et al . 


364/200 


n 


4471433 


September 1984 


Matsumoto et al. 


364/200 


□ 


4477872 


October 1984 


Losq et al. 


364/200 


D 


4547849 


October 1985 


Louie et al . 


364/200 


D 


4569018 


February 1986 


Hummel et al . 


364/200 


n 


4594651 


June 1986 


Jaswa et al . 


364/131 


n 


4597041 


June 1986 


Guyer 


364/200 


□ 


4604691 


August 1986 


Akagi 


364/200 


□ 


4608659 


August 1986 


Bradley 


364/737 


n 


4691277 


September 1987 


Kronstadt et al . 


364/200 


□ 


4701842 


October 1987 


Olnowich 


364/200 


n 


4710866 


December 1987 


Zolnowsky et al . 


364/200 


n 


4719570 


January 198 8 


Kawabe 


364/200 


n 


4725947 


February 1988 


Shonai 


364/200 


n 


4742451 


May 1988 


Bruckert et al . 


364/200 


n 


4764861 


August 1988 


Shibuya 


364/200 


n 


4819154 


April 1989 


Stif f ler 


364/200 


n 


4821183 


April 1989 


Hauris 


364/200 


n 


4827402 


May 1989 


Wada 


364/200 


n 


4858104 


August 1989 


Matsuo 


364/200 



OTHER PUBLICATIONS 

The New Riverside Dictionary by Houghton Mifflin Co., 1984, pp. 640, 641, Riverside 
Publishing Co. 

ART-UNIT: 232 

PR I MARY -EXAMINER : Eng; David Y. 
ASSISTANT-EXAMINER: Coleman; Eric 

ATT Y- AGENT -FIRM: Sughrue , Mion, Zinn, Macpeak & Seas 
ABSTRACT : 

In an approach to reducing delays resulting from resolution of conditional branch 
instructions, such instructions are pre-executed in a coprocessor which precedes a 
pipeline processor and prepared an instruction stream for input to the pipeline 
processor. Because of this pre-execution, the input instruction stream has fewer 
conditional branches for the pipeline processor to resolve. Also, the coprocessor 
may handle address generation interlock situations which also cause execution 
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delays in the pipeline processor. 
18 Claims, 17 Drawing figures 
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branch instructions 



Brief Summary Text (47) : 

When a branch is encountered, the branch stream coprocessor pre -executes it, along 
with the condition code setting instruction which usually immediately precedes it. 
The comparison between actual actions and targets and predicted actions and targets 
takes place before instructions are fed into the main stream pipeline. If 
necessary, the branch history table will be corrected. Correction takes place in 
this branch stream, rather than in the main stream, so that branch delays during 
instruction execution are reduced. Also, because the branch stream coprocessor 
scans instructions two at a time, the coprocessor will tend to stay sufficiently 
ahead of a main stream processor which actually executes the instructions. 

Drawing Description Text (15) : 

FIG. 14 is a chart showing the procedure for verifying condition codes set by the 
branch stream coprocessor . 

Detailed Description Text (59) : 

If a Test Under Mask or Compare Logical Immediate instruction is not marked as 
invalid as described above, it will proceed through the pipeline. The operand will 
be fetched from cache 101 via the paths 551 and 553, and the pre-execute step 559 
will generate the condition code and send it to a BSC (branch stream coprocessor) 
condition code and valid logic circuit 561. In this logic, an unlikely but possible 
situation is checked for. A Test Under Mask or Compare Logical Immediate 
instruction is almost always followed by a Branch on Condition or Branch on 
Condition Register, in which case the condition code is the proper one for the 
Branch on Condition or Branch on Condition Register instruction to act on. However, 
there could be an intervening instruction which could alter the condition code 
generated by the Test Under Mask or Compare Logical Immediate instruction, thereby 
rendering the generated condition code unreliable. When such a situation is 
detected, the condition code is marked invalid The logic for this procedure is 
shown in tabular form in FIG. 14. Note that three successive instructions enter 
into this logic. 

Detailed Description Text (69) : 

2. The first instruction in the next instruction pair register 703 is either a Test 
Under Mask or a Compare Logical Immediate instruction, which only set the condition 
code. Since the correct condition code value was determined by the branch stream 
coprocessor 501, and is part of entry in the ready-to-decode buffer 601, it is not 
necessary to refetch the Test Under Mask or Compare Logical Immediate operand. The 
condition code value will be carried along through the pipeline of the main stream 
processor 701 and will be used to update the main stream processor condition code 
at the proper time during execution of the Test Under Mask or Compare Logical 
Immediate instruction. 
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